speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Quantized Approximate Signal Processing (QASP): Towards Homomorphic Encryption for audio

Add code
May 15, 2025
Viaarxiv icon

A Comparative Analysis of Static Word Embeddings for Hungarian

Add code
May 12, 2025
Viaarxiv icon

Empirical Analysis of Asynchronous Federated Learning on Heterogeneous Devices: Efficiency, Fairness, and Privacy Trade-offs

Add code
May 11, 2025
Viaarxiv icon

Teochew-Wild: The First In-the-wild Teochew Dataset with Orthographic Annotations

Add code
May 08, 2025
Viaarxiv icon

Robust Speech Recognition with Schrödinger Bridge-Based Speech Enhancement

Add code
May 07, 2025
Viaarxiv icon

SwinLip: An Efficient Visual Speech Encoder for Lip Reading Using Swin Transformer

Add code
May 07, 2025
Viaarxiv icon

Fairness of Automatic Speech Recognition in Cleft Lip and Palate Speech

Add code
May 06, 2025
Viaarxiv icon

CoGenAV: Versatile Audio-Visual Representation Learning via Contrastive-Generative Synchronization

Add code
May 06, 2025
Viaarxiv icon

SepALM: Audio Language Models Are Error Correctors for Robust Speech Separation

Add code
May 06, 2025
Viaarxiv icon

VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model

Add code
May 06, 2025
Viaarxiv icon